Russian Stress Prediction using Maximum Entropy Ranking
نویسندگان
چکیده
We explore a model of stress prediction in Russian using a combination of local contextual features and linguisticallymotivated features associated with the word’s stem and suffix. We frame this as a ranking problem, where the objective is to rank the pronunciation with the correct stress above those with incorrect stress. We train our models using a simple Maximum Entropy ranking framework allowing for efficient prediction. An empirical evaluation shows that a model combining the local contextual features and the linguistically-motivated non-local features performs best in identifying both primary and secondary stress.
منابع مشابه
Applications of maximum entropy rankers to problems in spoken language processing
We report on two applications of Maximum Entropy-based ranking models to problems of relevance to automatic speech recognition and text-to-speech synthesis. The first is stress prediction in Russian, a language with notoriously complex morphology and stress rules. The second is the classification of alphabetic non-standard words, which may be read as words (NATO), as letter sequences USA, or as...
متن کاملPrediction of potential habitats of Astracantha gossypina (Fisch.) Using the maximum entropy model in regional scale
Astracantha gossypina (Fisch.) is one of the most important rangeland plants in the northeastern region of Iran and has a great role in soil conservation and the economy of ranchers. unreasonable uses and management of desire plant species usually led to species loss and the replacement of endemic and specialist species by invasive endemic or exotic species. This study was conducted to determin...
متن کاملMaximum Entropy Models for Realization Ranking
In this paper we describe and evaluate different statistical models for the task of realization ranking, i.e. the problem of discriminating between competing surface realizations generated for a given input semantics. Three models are trained and tested; an n-gram language model, a discriminative maximum entropy model using structural features, and a combination of these two. Our realization co...
متن کاملMachine Transliteration Using Multiple Transliteration Engines and Hypothesis Re-Ranking
This paper describes a novel method of improving machine transliteration by using multiple transliteration hypotheses and re-ranking them. We constructed seven machine-transliteration engines to produce a set of transliteration hypotheses. We then re-ranked the hypotheses to select the correct transliteration hypothesis. We propose a re-ranking method that makes use of confidence-score, languag...
متن کاملStatistical Ranking in Tactical Generation
In this paper we describe and evaluate several statistical models for the task of realization ranking, i.e. the problem of discriminating between competing surface realizations generated for a given input semantics. Three models (and several variants) are trained and tested: an n-gram language model, a discriminative maximum entropy model using structural information (and incorporating the lang...
متن کامل